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Abstract 

We construct a model of a lattice polymer which describes sec- 
ondary structures of proteins. In this model the energy of a confor- 
mation of a polymer is equal to a sum of energies of conformations of 
segments of the polymer chain of the length five. 

We show that for this model with cooperative interaction all con- 
formations with minimal energy are combinations of lattice models of 
alpha-helix and beta-strand. We show that for lattice polymers of the 
length not longer that 38 monomers we can describe all conformations 
with minimal energy. 

1 Introduction 

In the present paper we construct the model of lattice polymer (the quinary 
lattice model) where the minima of energy will have the form of lattice models 
of combinations of alpha-helices and beta-strands. The energy of a protein 
in this model will be equal to the sum over conformations of segments of 
a polymer of the length five. Therefore instead of consideration of the ex- 
plicit interaction between amino acids in a protein we consider a cooperative 
interaction in a polymer chain. 

Lattice models of polymers were extensively discussed in the literature, 
cf. [U El El H]. In El [7] design of sequences of polymers which fold to 
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given conformations was discussed. For the review of physics of proteins see 
[SI [H] - A protein globule was considered as a crumpled globule [TD], in [TT] 
the relation of this kind of globules and random hierarchical networks was 
discussed. 

The problem of protein folding, i.e. folding to the native of a protein 
instead of performing search over a vast space of conformations is well known. 
The approach of folding funnels based on the idea of cooperative (collective) 
interactions in proteins was discussed, see for example [T2] . 

In [13] the review of application of combinatorial algorithms to protein 
folding was given and investigation of complexity of these algorithms was 
discussed. 

In [T5] the GOR method of analysis of conformations of proteins was 
developed. In this approach the probability of an amino acid to belong to 
some secondary structure was considered as dependent on a short segment 
of the protein containing this amino acid. 

In (15] proteins were considered as combinations of short sequences (in 
particular 5-tuples) of amino acids, in [TB] the statistics of conformations of 
short segments of proteins was discussed. 

In [T7] native states of proteins were described with the help of combina- 
tions of solitons. 

The structure of the present paper is as follows. 

In section 2 we introduce the quinary lattice model of a polymer and 
show that the lattice models of alpha-helix and beta-strand are the energy 
minima for the introduced model. 

In section 3 we describe minima of energy of the proposed model and 
show that these conformations can be considered as combinations of lattice 
alpha-helices and beta-strands. 

In section 4 we consider the heteropolymer version of the introduced 
model and show that in this case we can describe polymers which possess a 
native tertiary structure — a conformation which minimizes the energy and 
is uniquely defined by the sequence of the polymer. The obtained tertiary 
structure is a combination of secondary structures described in sections 2 
and 3. 

In section 5 we discuss the model of energy of a lattice polymer which is 
a combination of the quinary lattice model and the standard model of the 
nearest (in the lattice) neighbors interaction of amino acids. 
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2 The quinary lattice model 



The standard model of energy of lattice polymers has the following form, see 
PQ, [2]. One considers a linear lattice polymer (a finite sequence of monomers 
connected by edges of the length one), the monomers are situated at vertices 
of the cubic lattice Z 3 . A conformation of a polymer of the length N is a 
sequence of neighbor vertices without self-intersections in the cubic lattice 
Z 3 , i.e. the injective map 

r:{l,...,iV}^Z 3 , (1) 

where neighbor natural numbers map to neighbor (i.e. distance one) vertices 
of the lattice Z 3 . In the following we will denote T also the image of this 
map. 

Energy of the conformation F in the standard model is proportional to 
the following sum 

e 2 (t) = - mmnm, (2) 

l<«<j<N 

where T(i) is the z-th monomer in the polymer, <f(-, •) is the distance in Z 3 , 
5(1) = 1, 5(i) — 0, i > 1. Non zero contributions to this sum come from the 
contacts of pairs of monomers in the conformation T. 

The above model takes into account only the number of contacts in the 
conformation but does not distinguish between the conformations with the 
different geometry. Therefore this model does not describe formation of sec- 
ondary structures (special preferred conformations of a protein), in particu- 
lar, alpha-helices and beta-strands. 

In the present paper we introduce the following model of lattice polymer. 
The energy of a polymer in this model will be equal to the sum of contribu- 
tions where each of the contributions depends on conformation of a segment 
of a lattice polymer of the length 5 (i.e. which contains five monomers). 
Therefore the energy of a polymer of the length N in conformation T has the 
form 

iV-2 

E 5 (r) = -j2m)- (3) 

i=3 

Here Tj is a conformation of the i-th 5-tuple of monomers in a polymer, i.e. 

r, = (r(i - 2), r(i - 1), r(i), r(i + 1), r(i + 2)), i = 3, . . . , n - 2, 
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Conformation 1 



Conformation 2 




Fig. 1 





where T(i) is the z'-th monomer in a polymer of the length N, $ is some func- 
tion of conformations of 5-tuples of monomers. The function $ is taken to be 
invariant with respect to lattice rotations and translations of conformations 
of 5-tuples. 

The intersection of the two neighbor 5-tuples Tj and r, + i contains the 
four monomers F(i — 1), T(i), T(i + 1), T(i + 2). 

It is easy the see that (modulo lattice translations and rotations) there 
exist 30 different conformations of 5-tuples without self-intersections. We 
choose the function $ as follows — $(T) is equal to zero for all conformations 
except the conformations denoted 1 and 2 for which $(1) = $(2) = 1, cf. 
Fig. 1 (by definition $ will be also equal to one for lattice translations and 
rotations of conformations 1 and 2). 

At Fig. 1 the edges connect neighbor monomers, thus a segment of a 
polymer containing five monomers will contain four edges. Conformations 1 
and 2 are segments of a right handed helix, see. Fig. 2. 

Proof of the following lemma is straightforward. 

Lemma 1 Conformation of any 5-tuple of neighbor monomers in periodic 
conformations of lattice polymers at Fig. 2 is either conformation 1 or con- 
formation 2 of Fig. 1 . Therefore these periodic conformations are minima of 
energy (0). 

The left conformation at Fig. 2 may be considered as a lattice model of 
a-helix, the right conformation can be considered as a model of /3-strand. 
Therefore in the model under consideration we observe the two most impor- 
tant examples of secondary structures. 

In real proteins /3-strands are stabilized by hydrogen bonds between par- 
allel chains, therefore to have a stable conformation one needs at least two 
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a-helix 



/3-strand 




parallel /?-strands. Our model neglects hydrogen bonds and considers con- 
formations which model folds of peptide chain in secondary structures as 
energetically profitable. Therefore this model can not pretend to give a re- 
alistic description of conformations of real proteins. Our aim is to describe 
qualitatively the effect of existence of secondary structures. 

In the next section we will show that the described lattice a and /3 con- 
formations and their combinations are all possible minima of energy for the 
quinary lattice polymer model (J3]). 
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3 Enumeration of minimal conformations 

Let us fix the directions in the 5-tuples of monomers at Fig. 1. We choose 
the beginnings and the ends of 5-tuples, see Fig. 3 (where B denotes the 
beginning and E denotes the end). The obtained conformations of directed 
polymers we denote 1 and 2 (the left and the right conformations at Fig. 3 
correspondingly). The same conformations but with the opposite directions 
(from the end to the beginning) we denote 1 and 2 . 

Conformations of lattice polymers which are minima of the energy ([3]), 
i.e. conformations of lattice polymers for^ which any 5-tuple of neighbor 
monomers has one of the conformations 1 , 2 , 1 , 2 are called minimal. 
Note that the neighbor 5-tuples of monomers in a polymer (or 4-tuples of 
edges shown at Fig. 3) have the intersection containing 4 monomers (or 3 
edges). We consider these conformations as models of secondary structures 
in proteins. 

A lattice polymer in a minimal conformation V generates a sequence 
r 3 r 4 . . . T at_2 of conformations of 5-tuples of monomers (when we read the 
sequence of monomers in the polymer from the beginning to the end), where 
Ei £ { 1 , 2, 1, 2}. The conformation T of a polymer can be restored from 
the sequence r 3 r 4 . . . of conformations of 5-tuples. 

We say that a minimal conformation r 3 r 4 . . . V m-2 is a continuation of 
a minimal conformation r 3 r 4 . . . T^_ 2 , N < M if the second sequence of 
symbols is a segment of the first sequence (i.e. the first sequence is obtained 
from the second by adding some symbols in the beginning and^the end). 

Which sequences r 3 r 4 . . . T N _ 2 of the conformations 1,2,1,2" can be 
generated by minimal conformations of a lattice polymer? It is not always 
possible to combine a couple of conformations 1 , 2 , 1 , 2 of 5-tuples into 
a single conformation of a 6-tuple due to geometric restrictions, see the next 
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lemma. 



Lemma 2 1) Possible pairs of neighbor conformations of 5-tuples in the 
sequence ^^...1^-2 related to some minimal conformation of a lattice 
polymer are described by the following table 
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(i.e. for any pair of symbols denoted by + in the table above there exists a 
minimal conformation of a lattice polymer of length 6). 

2) Any conformation described by a triple of symbols from { 1 , z , I , 
2 } permitted by the above table corresponds to some minimal conformation 
of a lattice polymer of length 7 except the triples 

Theorem H] below shows that the above lemma describes all possible geo- 
metric restrictions for lattice polymers of the length shorter or equal to 38, i.e. 
for any sequence of conformations of 5-tuples r 3 r 4 . . . N < 38, which 

satisfies the conditions of the above lemma [2], there exists the correspond- 
ing minimal conformation (without self-intersections) of a lattice polymer. 
Therefore in order to construct minimal conformations of sufficiently short 
lattice polymers it is sufficient to take into account the geometric restrictions 
for neighbor couples and triples of 5-tuples Tj. 

Let us consider the following periodic sequences of conformations of 5- 
tuples satisfying the conditions of lemma [2] 

(ttVT)..., (tt).... 

Here the periods are shown in brackets (i.e. one can iterate the sequence in 
brackets). The corresponding conformations of lattice polymers are shown 
at Fig. 4. 

The next lemma describes the sequences of conformations of 5-tuples for 
the lattice alpha and beta structures described in lemma [TJ 
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Fig. 4 



Lemma 3 Periodic sequences of conformations of 5-tuples with the periods 

a = (Wt), /? = (tt), 

see Fig. 4, correspond to conformations of a lattice polymer without self- 
intersections. Moreover these conformations are lattice a-helix and /3-strand 
(see Fig. 2) correspondingly. 

The next theorem describes all minimal conformations for the model (EJ). 
We show that all these conformations correspond to combinations of a and 
/3 structures and for polymers not longer than 38 all such combinations are 
possible (correspond to conformations without self-intersections). For longer 
polymers self-intersections are possible therefore only part of combinations of 
a and j3 structures correspond to minimal conformations of lattice polymers. 

Theorem 4 1) Any minimal conformation of a lattice polymer ([3]) with 
the length N > 6 has the following form: 

The corresponding sequence r 3 r 4 . . . of conformations of 5-tuples 

can be obtained from some sequence of a and (3 structures, a = 12 2 1, 
(3=11 by elimination of a finite number of symbols 1 , in the 

beginning and the end of the sequence. 

2) All conformations of a lattice polymer with the length 6 < N < 39 
obtained as above do not contain self intersections. 

There exists a sequence of conformations of 5-tuples corresponding to the 
conformation of a lattice polymer of the length 39 with self-intersections. 

Proof By the definition a minimal conformation of lattice polymer of the 
length iV generates a sequence r 3 r 4 . . . T N _ 2 of conformations of 5-tuples of 
monomers, Ti G { 1 , 1, 2 , 2}. 

Let us discuss how the conformations 2 (i.e. *2 or if) can be situated 
in this sequence. By lemma |5] we can not have more that two consecutive 
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Fig. 5 



symbols 2 in the sequence. A single conformation 2 can be situated either 
at the beginning or the end of the sequence. If the polymer is longer than 
six monomers the minimal conformation of this polymer can not contain a 
subsequence 2 2 (since this sequence can be continued only by 2 which is 
forbidden) . 

Therefore conformations 2 can be found inside a sequence corresponding 
to a minimal conformation only in pairs and for polymers longer than six 
monomers this pair has the form 2 2 . Any pair of this form by lemma 
[5] should be augmented by conformations 1 (again modulo the boundaries 
of the sequence), and the corresponding conformation (which corresponds 
to a segment of a lattice polymer of the length eight) will have the form 
a = "f* 2 *2 1~. 

The part of the sequence of conformations of 5-tuples of monomers which 
does not contain conformations 2 contains iterations of the conformation 
(3=11. This implies the first statement of the theorem. 

In order to prove the second statement of the theorem let us consider 
Fig. 10 with a lattice polymer of the length 38 in the minimal conformation 
afiaafiaafiaafi. One can see that any minimal conformation which is a 
continuation of this conformation will have self-intersections. 

We have already checked that iterations of a or f3 structures are minimal 
conformations without self-intersections. Let us prove that minimal confor- 
mations of sufficiently short lattice polymers containing a mixture of a and 
j3 structures also do not contain self-intersections. 

Let us put in correspondence to a and f3 structures their central lines 
as shown at Fig. 5 by dotted lines. For the a-structure the central line 
connects the centers of the opposite faces of the cube at Fig. 5 (edges of the 
a-structure will be the edges of this cube). Analogously for the /3-structure 
the central line will connects the adjacent faces of the corresponding cube. 
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Conformation a/3 



Conformation j3a 
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The different combinations of a and (3 structures can be found at the 
figures 6, 7, 8, 9. One can find that: 

1) Central line of a minimal conformation of a lattice polymer is a con- 
tinuous broken line; 

2) For the joint of two a structures or two /3 structures the central lines 
of joint structures will be parallel, therefore central lines of aa . . . and (3/3 . . . 
will be straight; 

3) For the joint of the different structures (a/3 or /3a) the central line 
breaks with the angle 135°. 

For obtaining a self-intersection of a minimal conformation of a lattice 
polymer we need several breaks of the central line at contacts of a and 
(3 structures. The central line have to rotate for more that 180° (i.e. we 
need more than four breaks). One can see that the shortest lattice polymer 
with minimal conformation with self-intersection will correspond to some 
continuation of the conformation at Fig. 10, for example (3/3aa(3aa/3aa/3 1 . 

Therefore any minimal conformation described at the statement of the 
theorem for a lattice polymer with the length not larger than 38 can be 
realized without self-intersections. 

This finishes the proof of the theorem. □ 

Remark Theorem H] shows that for a lattice polymer with the length larger 
than six monomers the minima of energy ([3]) have the form of combinations 
of lattice a and (3 structures, and for polymers not longer than 38 all such 
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combinations do not have self-intersections. Therefore collective interactions 
in models of lattice polymers are able to describe the formation of secondary 
structures in proteins. Arising of self-intersections for long lattice polymers 
is natural — for real proteins not all arbitrary combinations of a and j3 
structures are possible. 



4 Heteropolymers and native conformations 

In the present section we consider a heteropolymer analogue of the quinary 
model of lattice polymer fl3]). We consider a lattice polymer which consists 
of monomers of the two kinds A and B. This polymer has the sequence of 
monomers S and the conformation V. We define the energy of the polymer 
by the following modification of the formula ([3]): 

N-2 

e 5 ,(s, r) = - [#b($) ($i(r,) + e $ 2 (r,)) + ($ 2 (r.) + e^r,))] . 

i=3 

(4) 

Here Si is the sequence of monomers in the i-th 5-tuple of monomers in the 
polymer, Tj is the conformation of the i-th 5-tuple, the function $i is equal 
to one for the conformation 1 at Fig. 1 and to zero for all other conformations 
(i.e. this is a characteristic function of the conformation 1 in the space of 
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Conformation 1 f3aa(3aa[3aa(3^t (38 vertices) 
there exists a continuation with self-intersection 
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Fig. 10 
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conformations of 5-tuples of monomers )□, analogously the function $ 2 is the 
characteristic function of the conformation 2 at Fig. 1 (thus $ = $ x + $ 2 
in formula (EJ). The #a{Si) denotes the number of monomers of the kind 
A in the z-th 5-tuple of monomers (thus < #A{Si) < 5), e is a positive 
parameter, < e < 1. The analogous notations are used for #s(5'i) - the 
number of monomers of the kind B in the z'-th 5-tuple. 

With the above choice of energy a lattice polymer which contains only 
monomers B will have the minimum of energy in the conformation equal to 
a single /3-strand, since a /3-strand contains only conformations 1 , 1 of 
5-tuples of monomers. 

Analogously a lattice polymer which consists only of monomers A will 
maximize the number of conformations if, *2 of 5-tuples in the conformation 
of the polymer. This can be achieved at lattice a-helix (in an a-helix a 
half of conformations of 5-tuples of monomers are of the type 1 and a half 
are of the type 2). The term e$i(Fj) in expression @ for energy makes 
the conformations 1 , 1 more profitable energetically than a conformation 
which is neither of the type 1 nor of the type 2. 

Therefore a lattice heteropolymer with the energy (J3J) for some sequences 
S of monomers A and B will possess a native tertiary structure — a con- 
formation which is uniquely defined by the sequence S of monomers (the 
primary structure of the polymer), minimizes the energy (jlj) and consists of 
a combination of secondary structures (lattice a-helices and /3-strands). 

Remark We do not claim that for an arbitrary sequence S of monomers A 
and B for a lattice polymer with energy (jlj) there exists a unique native struc- 
ture. Real copolymers of amino acids also possess native tertiary structures 
only for some sequences of amino acids. 

5 Interaction of secondary structures 

Let us discuss the following generalization of the introduced in the present 
paper model of lattice polymers. For this generalization the expression for 
energy of a lattice polymer is given by the following linear combination 

E = E 5 + XE 2 , (5) 

*As in section 2 we consider the conformations modulo lattice translations and rota- 
tions. 
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where A > is a small parameter, E 2 is given by and E§ is given by (E}. 

Let us consider the conformation of a lattice polymer in the form of 
sufficiently long lattice a-helix. If A = then this conformation will be a 
minimum of energy ([5]) . 

Let us consider also the conformation which has the form of two shorter 
parallel a-helices in contact (i.e. the distance between the helices is equal to 
one). This conformation may be considered as an a-helix folded in half. If 
A is larger than some threshold then the energy of a folded in half a-helix 
will be larger than the energy of a single longer a-helix since the energetic 
gain will contain contributions fl2]) of contacts of a large number of monomers 
in the two a-helices, and energetic loss will come from contributions (JH]) of 
small number of 5-tuples of monomers in the area where a-helix was folded 
in half. 

We have obtained the tertiary structure of a polymer — a compact globule 
built of secondary structures. Let us note that in this model it is important 
that the contributions to the energy of conformation from cooperative inter- 
action ([3]) (which generates secondary structures) is larger than the energy 
of contacts of monomers in ([2]). 

6 Conclusion 

We construct a model of a lattice polymer (the quinary lattice model) which 
describes secondary structures of proteins. In this model the energy of a 
conformation of a polymer is equal to a sum of energies of conformations of 
segments of the polymer chain of the length five. 

We show that for this model with cooperative interaction all conforma- 
tions with minimal energy are combinations of lattice models of alpha-helix 
and beta-strand. We show that for lattice polymers of the length not longer 
that 38 monomers we can describe all conformations with minimal energy. 

Models of energy of lattice polymers where contributions from contacts 
of several monomers were taken into account were discussed in the literature 
[2], but dependence on conformations of short segments of a polymer was 
not considered. The corresponding collective contributions were discussed 
as corrections to the contact interaction ([2]) of monomers. In our model 
(J3j) the cooperative interaction and conformational dependence of energy of 
interaction are principal. 

We do not pretend that the model proposed in the present paper gives 
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a realistic description of conformations of proteins. Our aim was to give 
a qualitative demonstration of the importance of conformationally depen- 
dent cooperative interactions at short segments of a protein for creation of 
secondary structures. 
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